Team members
- Michele Claibourn
- Michael Salgueiro
- Chase Dawson
- Jacob Goldstein-Greenwood
- Jordan House
- Khalila Karefa-Kargbo
- Lee LeBoeuf
- Marisa Lemma
- Helena Lindsay
- Tolu Odukoya
Data Collection Process
Throughout the summer and fall of 2021, we’ve been creating and refining a replicable data collection process to build a data collection resource for use in our collective work and by anyone in the community. The process is intended to make our work transparent, to provide resources for other to use, and to be highly automated for easier updates.

Beginning with needs and requests for additional information and data from within our coalition, and those articulated by additional community partners, we
- Begin researching available sources, seeking to undertand the provenance and genesis of the data (e.g., collected via surveys, captured from satellite imagery, derived from models build around station monitors, etc.), the temporality and spatial granularity (how frequently is it updated, what areas is it available for or could it be aggreated to), and the available variables and measures within the initial sources.
- Create code to acquire the data from source, working to remove as many manual steps as possible; to process the data, filtering to our region, checking data quality, deriving additional measures from included variables, aggregating to administrative boundaries for integration with demographic and population data. Output includes replication code and a csv file of the resulting data.
- Create code to build a documentation file identifying the source of the data, providing variable definitions, and generating initial visualization of the key metrics. Output include replicate code and a web document providing more details about the data source.
This winter and spring, we will continue to add to the current data collections, clean up and refine our work to date, and begin (4) to integrate the data sources for further analysis and visualization
Newly Available Data Collections
To build on the the population data we’ve collected as part of the broader Equity Atlas Prototype (e.g., demographic, economic, health, and other social data) and Shelter in Place measures (e.g., food, car, broadband access), the table below provides an overview of the data collections, including motivating questions, key measures, and data sources. The table can be filtered for key topics (climate measures, risk factors, community assets and infrastructure, transportation). In the next stage, we’ll begin to merge these measures with previously compiled data to visualize relationships between residents and resources.
Topic icon attribution
Data Examples
CDC Places: Health Outcomes
These data come from the Center for Disease Control and Prevention (CDC) PLACES: Local Data for Better Health. The data provide model-based estimates for chronic disease risk factors, health outcomes, and clinical prevention services. According the the website, “CDC uses an innovative peer-reviewed multilevel regression and poststratification (MPR) approach that links geocoded health surveys and high spatial resolution population demographic and socioeconomic data.”
Asthma
- Adjusted percent of survey respondents aged >= 18 who reported that they had ever been told by a health professional that they have asthma and that they still have asthma.
Coronary Heart Disease
- Adjusted percent of survey respondents aged >= 18 who reported that they had ever been told by a health professional that they had angina or coronary hearth disease.
Cancer (excluding skin cancer)
- Adjusted percent of survey respondents aged >= 18 who reported that they had ever been told by a health professional that they have any type of cancer except skin cancer. This variable is not specific to any type of cancer.
Diabetes
- Adjusted percent of survey respondents aged >= 18 who reported that they had ever been told by a health professional that they had diabetes (excluding diabetes during pregnancy).
Obesity
- Adjusted percent of survey respondents aged >= 18 who reported that they have a BMI >= 30kg/m^2. This indicator is calculated by the CDC from self-reported weight and height, excluding pregnant women. Self-reported height and weight tend to lead to lower BMI estimates than clinical height and weight measurements.
Employment and Commute Patterns
The Origin-Destination Employment Statistics (LODES) pairs the census blocks for where employees live and work, allowing us to estimate outcomes like the the number of low-wage jobs, the median commuting distance of residents in the Eastern Shore region, and the percent of workers in the region who live elsewhere.
Percent of Low-Wage Jobs
Proportion of low-wage earning jobs (earnings $1250/month or less).
Percent of Non-Residential Workers
HMDA: Home Mortgage Applications
These data come from the Consumer Financial Protection Bureau and Federal Financial Institutions Examinations Council. All data were collected as part of the Home Mortgage Disclosure Act (HMDA). The data shown below include only home purchase loans, meaning that mortgages for home improvement or refinancing are excluded.
Applications by Tract, 2020